Multiple-Goal Reinforcement Learning with Modular Sarsa(0)
نویسندگان
چکیده
We present a new algorithm, GM-Sarsa(0), for finding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processes. According to our formulation different sub-goals are modeled as MDPs that are coupled by the requirement that they share actions. Existing reinforcement learning algorithms address similar problem formulations by first finding optimal policies for the component MDPs, and then merging these into a policy for the composite task. The problem with such methods is that policies that are optimized separately may or may not perform well when they are merged into a composite solution. Instead of searching for optimal policies for the component MDPs in isolation, our approach finds good policies in the context of the composite task. This material is based upon work supported by a grant from the Department of Education under grant number P200A000306, a grant from the National Institutes of Health under grant number 5P41RR09283 and a grant from the National Science Foundation under grant number E1A-0080124.
منابع مشابه
Multiple-Goal Reinforcement Learning with Modular Sarsa(O)
We present a new algorithm, GM-Sarsa(O), for finding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processes. According to our formulation different sub-goals are modeled as MDPs that are coupled by the requirement that they share actions. Existing reinforcement learning algorithms address similar problem formulations by fir...
متن کاملGlobal Policy Construction in Modular Reinforcement Learning
We propose a modular reinforcement learning algorithm which decomposes a Markov decision process into independent modules. Each module is trained using Sarsa(λ). We introduce three algorithms for forming global policy from modules policies, and demonstrate our results using a 2D grid world.
متن کاملCar Simulation Using Reinforcement Learning
This project report presents the result of Reinforcement Learning (RL) experiments in a car simulation. W ithout any knowledge of the tracks in advance, the car can be trained to avoid bumping into the walls by learning from the given rewards. We have built a car simulation system in which the car can be trained and tested on the tracks with several RL algorithms , including Actor-Critic method...
متن کاملارائه الگوریتم جدید Fuzzy SARSA بهمنظور پیش بینی نوسانات سطح قند خون بیماران مبتلا به دیابت نوع یک
Background: One of the serious complications of type 1 diabetes is a sudden increase and drop in blood glucose levels causing risks of anesthesia and coma. Thus, an important step towards the optimal control of the disease is to use intelligent methods with low error rate and available information in order to predict and prevent such complications. In this paper, a combined Fuzzy SARSA algorith...
متن کاملFuzzy Sarsa: An approach to linear function approximation in reinforcement learning
This paper investigates two different approaches to learning using an agent electronic marketplace as test bed. The types of learning considered in this paper include the temporal difference (TD) learning algorithm Sarsa, and two new fuzzified versions of this algorithm, FQ Sarsa and Fuzzy Sarsa. We implement the three learning algorithms in an agent test bed in order to determine their usefuln...
متن کامل